Transforming Examples into Patterns for Information Extraction
نویسندگان
چکیده
Information Extract ion (IE) systems today are commonly based on pat tern matching. The pat terns are regular expressions stored in a customizable knowledge base. Adapting an IE system to a new subject domain entails the construction of a new pat tern base a t ime-consuming and expensive task. We describe a s trategy for building pat terns from examples. To adapt the IE system to a new domain quickly, the user chooses a set of examples in a training text, and for each example gives the logical form entries which the example induces. The system transforms these examples into pat terns and then applies meta-rules to generalize these patterns.
منابع مشابه
Learning information extraction patterns from examples
Abs t r ac t . A growing population of users want to extract a growing variety of information from on-line texts. Unfortunately, current information extraction systems typically require experts to hand-build dictionaries of extraction patterns for each new type of information to be extracted. This paper presents a system that can learn dictionaries of extraction patterns directly from user-prov...
متن کاملLearning information extraction patterns from examples
A growing population of users want to extract a growing variety of information from on-line texts. Unfortunately, current information extraction systems typically require experts to hand-build dictionaries of extraction patterns for each new type of information to be extracted. This paper presents a system that can learn dictionaries of extraction patterns directly from user-provided examples o...
متن کاملFeature selection using genetic algorithm for classification of schizophrenia using fMRI data
In this paper we propose a new method for classification of subjects into schizophrenia and control groups using functional magnetic resonance imaging (fMRI) data. In the preprocessing step, the number of fMRI time points is reduced using principal component analysis (PCA). Then, independent component analysis (ICA) is used for further data analysis. It estimates independent components (ICs) of...
متن کاملA Logical Framework for Template Creation and Information Extraction
Information extraction is the process of automatically identifying facts of interest from pieces of text, and so transforming free text into a structured database. Past work has often been successful but ad hoc, and in this paper we propose a more formal basis from which to discuss information extraction. We introduce a framework which will allow researchers to compare their methods as well as ...
متن کاملExtraction-Stripping Patterns during Co-Extraction of Copper and Nickel from Ammoniacal Solutions into Emulsion Liquid Membranes Using LIX 84I®
Extraction of nickel and its co-extraction with copper from ammoniacal media into emulsion liquid membrane systems (ELMs) was investigated using LIX 84I as the carrier. Measurement of the solute stripped in the internal phase of emulsion opened a new dimension in the study of the ELM extraction processes. The effect of operating parameters such as feed pH, initial feed concentration, and treat ...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 1998